CDS

Accession Number TCMCG068C54242
gbkey CDS
Protein Id KAG5622113.1
Location join(30629486..30629499,30629593..30629674,30629783..30629863,30630854..30630967,30631066..30631167,30632382..30632440,30634132..30634312,30634852..30634917,30634998..30635090)
Organism Solanum commersonii
locus_tag H5410_007331

Protein

Length 263aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA655804, BioSample:SAMN15755581
db_source JACXVP010000002.1
Definition hypothetical protein H5410_007331 [Solanum commersonii]
Locus_tag H5410_007331

EGGNOG-MAPPER Annotation

COG_category O
Description The proteasome is a multicatalytic proteinase complex which is characterized by its ability to cleave peptides with Arg, Phe, Tyr, Leu, and Glu adjacent to the leaving group at neutral or slightly basic pH
KEGG_TC -
KEGG_Module M00337        [VIEW IN KEGG]
M00340        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03051        [VIEW IN KEGG]
KEGG_ko ko:K02729        [VIEW IN KEGG]
EC 3.4.25.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko03050        [VIEW IN KEGG]
map03050        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTTTCTTACCAGAACTGAGTACGATAGAGGTGTTAACACCTTTTCCCCTGAAGGACGATTGTTTCAAGTTGAATATGCTATTGAAGCTATCAAGTTGGGTTCAACTGCAATTGGATTGAAGACTAAGGAAGGAGTTGTTCTTGCTGTGGAGAAGCGCATTACTTCACCACTTCTGGAGCCAAGCAGTGTGGAGAAAATTATGGAAATTGACGAGCATATTGGCTGTGCAATGAGTGGATTGATAGCTGATGCACGGACTCTTGTGGAACATGCACGGGTTGAAACTCAGAACCATAGATTCTCTTATGGTGAGCCCATGACTGTTGAGTCCACTACCCAAGCTCTCTGTGATTTGGCGTTGCGATTTGGTGAGGGCGATGAAGAATCTATGTCCAGGCCTTTTGGTGTGTCCCTTCTCATTGCTGGTCATGATGAGAACGGTCCCAGCTTGTACTATACTGATCCTTCTGGTACATTCTGGCAATGCAATGCTAAAGCTATTGGGTCAGGTTCTGAAGGTGCTGATAGCTCTTTGCAGGAGCAGTATAACAAGGTATCTAAGTTTTCTTATTTTTCTAGTGAAACCATGTCCTATAGTCCTGATTACGGTCCAGTCTGCAGCAGGCTGGAGGACCTTACCCTTAAAGAAGCTGAAACCATAGCACTGTCAATCCTTAAGCAAGTGATGGAAGAGAAGGTGACTCCCAATAATGTTGATATTGCAAGGGTATCTCCAACTTACCATCTATACTCACCATCAGAGGTGGAAGAGGTTATCAGCCGCCTATAA
Protein:  
MFLTRTEYDRGVNTFSPEGRLFQVEYAIEAIKLGSTAIGLKTKEGVVLAVEKRITSPLLEPSSVEKIMEIDEHIGCAMSGLIADARTLVEHARVETQNHRFSYGEPMTVESTTQALCDLALRFGEGDEESMSRPFGVSLLIAGHDENGPSLYYTDPSGTFWQCNAKAIGSGSEGADSSLQEQYNKVSKFSYFSSETMSYSPDYGPVCSRLEDLTLKEAETIALSILKQVMEEKVTPNNVDIARVSPTYHLYSPSEVEEVISRL